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Translation of noncoding common variant association signals 
into meaningful molecular and biological mechanisms explaining 
disease susceptibility remains challenging. For the type 2 di- 
abetes association signal in JAZFl intron 1, we hypothesized that 
the underlying risk variants have c^s-regulatory effects in islets or 
other type 2 diabetes-relevant cell types. We used maps of ex- 
perimentally predicted open chromatin regions to prioritize var- 
iants for functional follow-up studies of transcriptional activity. 
Twelve regions containing type 2 diabetes-associated variants 
were tested for enhancer activity in 832/13 and MING insulinoma 
cells. Three regions exhibited enhancer activity and only 
rs 1635852 displayed allelic differences in enhancer activity; the 
type 2 diabetes risk allele T showed lower transcriptional activity 
than the nonrisk allele C. This risk allele showed increased bind- 
ing to protein complexes, suggesting that it functions as part of 
a transcriptional repressor complex. We applied DNA affinity 
capture to identify factors in the complex and determined that 
the risk allele preferentially binds the pancreatic master regulator 
PDXl. These data suggest that the rsl635852 region in JAZFl 
intron 1 is part of a c^s-regulatory complex and that maps of open 
chromatin are useful to guide identification of variants with alle- 
lic differences in regulatory activity at type 2 diabetes loci. 
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Genome-v^ide association studies (GWAS) have 
identified >50 genome-wide significant loci as- 
sociated with type 2 diabetes to date (1). For 
many of these loci, association signals are lo- 
calized to nonprotein-coding intronic and intergenic 
regions, which may contain variants that regulate gene 
transcription. A primary challenge remains in the transi- 
tion from GWAS locus discovery to identification of func- 
tional variants underlying disease susceptibility. Tools are 
needed to detect functional variants from the set of disease- 
associated variants at a locus. FAIRE-seq (formaldehyde- 
assisted isolation of regulatory elements) and DNase-seq 
are two methods that identify nucleosome-depleted (open 
chromatin) regions comprising active DNA regulatory 
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elements that include promoters, enhancers, silencers, and 
insulators (2,3). Integration of associated variants identi- 
fied through GWAS with tissue-relevant genome-wide 
maps of open chromatin from the ENCODE Consortium 
(4) and epigenomic maps from the Roadmap Epigenomics 
Consortium (5) has great potential to facilitate identifica- 
tion of regulatory variants (6-8). 

JAZFl (juxtaposed with another zinc finger protein) is 
one such locus containing variants strongly associated with 
type 2 diabetes (P = 5 X 10~^^) that are located within in- 
tron 1 (9). JAZFl encodes a putative transcription factor. 
JAZFl protein interacts with NR2C2 (nuclear receptor 
subfamily 2, group C, member 2) protein and represses 
NR2C2-mediated transactivation (10). NR2C2, also known 
as TR4, is an orphan nuclear receptor targeting many genes 
important in metabolism (11). Mice lacking Nr2c2 have 
perinatal and postnatal hypoglycemia (12) and are pro- 
tected from glucose intolerance and insulin resistance in- 
duced with a high-fat diet (13). JAZFl locus variants have 
been associated with impaired p-cell function (14). The 
function of JAZFl as a transcriptional repressor of a gene 
negatively influencing glucose metabohsm suggests that 
susceptibility alleles at this locus may result in decreased 
JAZFl transcription. Of note, islets from type 2 diabetes 
donors display decreased JAZFl expression, and higher 
levels of islet JAZFl are correlated with higher insulin se- 
cretion and higher glycemic control (15). Single nucleotide 
polymorphisms (SNPs) at the JAZFl association signal in 
intron 1 also are associated with height (16), and an in- 
dependent signal = 0.024 with GWAS index SNP 
rs849134) located >200 kb away is associated with prostate 
cancer (17). Additionally, expression quantitative trait locus 
(eQTL) analysis supports association of the type 2 diabetes 
risk allele with altered JAZFl transcript level in adipose 
tissue, liver, and muscle (18-20). 

To gain insight into the molecular mechanisms underlying 
the type 2 diabetes association at JAZFl, we used maps of 
open chromatin to guide identification of variants in potential 
regulatory elements. Based on evidence of an effect in p-cell 
function and insulin secretion (14,15), experiments were per- 
formed in two available mammalian (B-ceU lines. We measured 
transcriptional activity of prioritized variants using luciferase 
reporter assays and report a candidate cis-acting SNP that 
displays aUele-specific enhancer activity. We also evaluated 
allelic differences in protein binding and provide evidence of 
a potential molecular mechanism for JAZFl SNP effects. 

RESEARCH DESIGN AND METHODS 

Selection of SNPs for functional study. Variants were prioritized for 
functional study based on linkage disequilibrium (LD) and evidence of islet 
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open chromatin. All five SNPs in high LD (/ > 0.8; CEPH [Utah residents with 
ancestry from northern and western Europe] [CEU] 1,000G, March 2012 re- 
lease) with the GWAS index SNP rs849134 and present in an islet FAIRE peak 
(8) or DNase peak (6) were tested for evidence of differential transcriptional 
activity. We also tested three SNPs based on only high LD and four other SNPs 
in the region (low LD SNPs; < 0.5; Supplementary Table 1). 
Cell culture. Two insulinoma cell lines, rat-derived 832/13 (21) (C.B. Newgard, 
Duke University, Durham, NC) and mouse-derived M1N6 (22) were maintained 
at 37°C with 5% COg. The 832/13 cells were cultured in RPMl 1640 (Invitrogen, 
Carlsbad, CA) supplemented with 10% FBS, 1 mmol/L sodium pyruvate, 2 
mmol/L i^glutamine, 10 mmol/L HEPES, and 0.05 mmol/L (3-mercaptoethanol. 
M1N6 cells were cultured in DMEM (Invitrogen) supplemented with 10% FBS, 
1 mmol/L sodium pyruvate, and 0.1 mmol/L (3-mercaptoethanol. HepG2 he- 
patocellular carcinoma cells were cultured in MEM-a (Invitrogen) supple- 
mented with 10% FBS, 1 mmol/L sodium pyruvate, and 2 mmol/L L-glutamine. 
Differentiated 3T3-L1 adipocyte cells were maintained in DMEM supple- 
mented with 10% FBS, 1 fxmol/L dexamethasone, 0.5 mmol/L IBMX, and 1 ug/mL 
insulin. 

Generation of luciferase reporter constructs, transient DNA transfection, 
and luciferase reporter assays. The 150- to 200-bp fragments surrounding 
each of 12 SNPs were PCR-amplified (Supplementary Table 2) from DNA of 
individuals homozygous for risk and nonrisk alleles. Restriction sites for 
Kpnl and Xhol were added to primers during amplification, and the resulting 
PCR products were digested with Kpnl and Xhol and cloned in both ori- 
entations into the multiple cloning site of the minimal promoter-containing 
firefly luciferase reporter vector pGL4.23 (Promega, Madison, WI). Frag- 
ments are designated as forward or reverse based on their orientation in 
the genome with respect to the JAZFl coding sequence. Two to four in- 
dependent clones for each allele for each orientation were isolated, verified 
by sequencing, and transfected in duplicate into 832/13 and M1N6 p-cell lines. 

Approximately 1 X 10~^ cells per well were seeded in 24-well plates. At 80% 
confiuency, cells were cotransfected with luciferase constructs and Renilla 
control reporter vector (phRL-TK; Promega) at a ratio of 30:1 for M1N6 using 
Lipofectamine 2000 (Invitrogen) and at a ratio of 10:1 for 832/13 cells using 
FUGENE-6 (Roche Diagnostics, Indianapolis, IN). At 48 h after transfection, 
cells were lysed with passive lysis buffer (Promega), and luciferase activity 
was measured using the Dual-Lucif erase Assay System (Promega). To control 
for transfection efficiency, raw values for firefiy luciferase activity were di- 
vided by raw Renilla luciferase activity values, and fold change was calculated 
as normalized luciferase values divided by pGL4.23 minimal promoter empty 
vector control values. Data are reported as the fold change in mean (±SE) 
relative luciferase activity per allele. A two-sided t test was used to compare 
luciferase activity between alleles. Experiments in M1N6 and 832/13 cells were 
performed on a second independent day and yielded comparable allele- 
specific results. 

Electrophoretic mobility shift assay. Nuclear cell extracts were prepared 
from 832/13, MIN6, HepG2, and 3T3-L1 cells using the NE-PER nuclear and 
cytoplasmic extraction kit (Thermo Scientific) according to the manufacturer's 
instructions. Protein concentration was measured with a BCA protein assay 
(Thermo Scientific), and lysates were stored at -80°C until use. The 17-bp 
oligonucleotides were designed to the sequence surrounding rs 1635852 risk or 
nonrisk alleles as follows: sense 5' biotin-CTGATTAA[T/C]TCACTTAG 3' and 
antisense 5' biotin-CTAAGTGA[G/C]TTAATCAG3' (SNP allele in bold). Double- 
stranded oligonucleotides for the risk and nonrisk alleles were generated 
by incubating 50 pmol complementary oligonucleotides at 95° C for 5 min, 
followed by gradual cooling to room temperature. Electrophoretic mobility 
shift assays (EMSAs) were performed using the LightShift Chemiluminescent 
EMSA Kit (Thermo Scientific). Binding reactions were set-up as ix binding 
buffer, 50 ng/jjuL poly (dl dC), 3 jjLg nuclear extract, and 20 fmol labeled probe 
in a final volume of 20 jxL. For competition reactions, 67-fold excess of un- 
labeled double-stranded oligonucleotides for either the risk or the nonrisk 
allele were included. Reactions were incubated at room temperature for 25 
min. For supershift assays, 4 jjig polyclonal antibodies against PDXl (SC- 
14662 X; Santa Cruz Biotechnology) or CUXl (SC6327X; Santa Cruz Bio- 
technology) were added to the binding reaction and incubation proceeded 
for a further 25 min. Binding reactions were subjected to nondenaturing 
PAGE on DNA retardation gels in 0.5 X TBE (Invitrogen), transferred to nylon 
membranes (Invitrogen), and cross-linked on an ultraviolet light cross-linker 
(Stratagene). Biotin labeled DNA-protein complexes were detected by 
chemiluminescence. EMSAs were performed on a second independent day 
and yielded comparable results. 

DNA affinity capture assay. Nuclear extracts (prepared as for EMSA) were 
dialyzed against dialysis buffer (20 mmol/L Tris/HCl [pH 7.9], 70 mmol/L KCl, 
1 mmol/L EDTA) in a Slide-A-Lyzer MINI Dialysis Device (Thermo Scientific). 
Dialyzed nuclear extracts (300 jjig) were precleared with 100 jjlL streptavidin- 
agarose dynabeads (Invitrogen) coupled to biotin-labeled scrambled control 
oligonucleotides. This preclearing step was performed to reduce nonspecific 



binding of nuclear protein. For DNA-protein binding reactions, 40 pmol of 
biotin-labeled probe either for rs 1635852 allele (same probe as for EMSA) or 
for a scrambled control were incubated with 300 fxg nuclear extract, binding 
buffer (10 mmol/L Tris, 50 mmol/L KCL, 1 mmol/L DTT), 0.5 fxg/fxL poly 
(dl - dC), and H2O to total 450 fxL at room temperature for 30 min with rotation; 
100 |jlL (1 mg) of streptavidin-agarose dynabeads were added and the reaction 
was incubated for a further 20 min. Beads were washed four times in binding 
buffer with 0.05% NP-40 and once in binding buffer without NP-40. DNA-bound 
proteins were eluted in 1 X reducing sample buffer (Invitrogen) by heating for 
10 min at 70°C. Proteins were separated on NuPAGE denaturing gels and 
protein bands were stained with SYPRO-Ruby. Protein bands displaying dif- 
ferential binding between rs 1635852 alleles (but not scrambled control) were 
excised from the gel and subjected to matrix-assisted laser desorption time-of- 
fiight/time-of-fiight tandem mass spectrometry (MS) and analysis at the Uni- 
versity of North Carolina proteomics core facility. For peptide identification, 
all MS/MS spectra were searched against all entries in the National Center for 
Biotechnology Information nonredundant database using GPS Explorer soft- 
ware version 3.6 (ABI) and Mascot (MatrixScience) search algorithm. Mass 
tolerance was 80 ppm for precursor ions and 0.6 Da for fragment ions were 
used. In addition, two missed cleavages were allowed and oxidation of me- 
thionine was a variable modification. 



RESULTS 

Characterization of type 2 diabetes-associated SNPs 
in JAZFl intron 1 with regulatory potential. To dis- 
tinguish potentially functional regulatory variants from 
proxy variants in high LD at the type 2 diabetes-associated 
JAZFl locus, we selected variants in high LD (r^ > 0.8) 
with GWAS index SNP rs849134. To further prioritize 
variants for functional analysis, we used genome-wide 
maps of open chromatin (Fig. L4, C) in available type 2 
diabetes-relevant cell types, including pancreatic islets, 
liver hepatocytes, and skeletal muscle myotubes. DNase- 
seq and FAIRE-seq are well-established methods identify- 
ing both overlapping and unique nucleosome-depleted 
regions that include active regulatory elements (23). We 
also evaluated variant position with respect to the histone 
modifications H3K4mel and H3K9ac, which are post- 
translational marks associated with enhancer regions (Fig. 
ID) (24,25). Of 15 variants meeting the LD threshold, five 
SNPs were found to overlap an islet FAIRE peak or DNase 
peak and displayed some evidence of H3K4mel or H3K9ac 
signal (Fig. 1). Whereas enhancer histone modification 
profiles served to support evidence of function, they did 
not help for SNP prioritization because of an overall high 
background signal (Fig. ID). No SNPs overlapped with 
DNase or FAIRE peaks in the other cell types examined 
(data not shown). 

rsl635852 displays allele-specific enhancer activity in 
islet cells. To evaluate transcriptional activity of the five 
SNPs predicted to be in islet-regulatory regions, we cloned 
—200 bp surrounding each SNP allele into a minimal pro- 
moter reporter vector and measured luciferase activity in 
two p-cell lines, 832/13 rat insulinoma and M1N6 mouse 
insulinoma cells. Two to four independent clones for each 
allele were generated and enhancer activity was measured 
in duplicate for each clone. Of the five SNPs in open 
chromatin region, three displayed evidence of enhancer 
activity compared with an empty vector control in both 
cell lines in the forward orientations -rsl635852, rs849133, 
and rs849142 (Fig. 2A, B). Of these, rsl635852 showed 
differential allelic enhancer activity in both orientations in 
both cell lines. The risk allele T showed significantly de- 
creased luciferase activity compared with the nonrisk al- 
lele C (forward: 832/13 [P = 7.8 X 10"^] and MIN6 [P = 1.0 X 
10^^]; reverse: 832/13 [P = 1.8 X 10"^] and MIN6 [P = 8.3 X 
10~^]; Fig. 2A, B). Enhancer activity at rsl635852 represents 
> 2-fold and 1.7-fold increases in transcriptional activity 
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FIG. 1. Regulatory potential at type 2 diabetes-associated SNPs at the JAZFI locus. A: Twelve high LD SNPs (r^ ^ 0.80 with GWAS index SNP 
rs849134). Closed arrows indicate five SNPs overlapping open chromatin marks tested for allele-specific transcriptional activity. Open arrows 
indicate three SNPs without evidence of open chromatin and tested for allele-specific transcriptional activity for comparison. Four high LD SNPs 
without evidence of open chromatin were not tested for allele-specific transcriptional activity. B: FAIRE peaks identified in three islet samples. 
C: DNase hypersensitivity peaks identified in two pooled islet samples from the ENCODE Consortium. Di Islet H3K4mel and H3K9ac histone 
modifications from the Roadmap Epigenomics Consortium. Three high LD indels that do not overlap with open chromatin and lack reference SNP 
ID numbers (rs#) are shown. Four additional low LD SNPs located 12-20 kb proximal to the region shown were tested for allele-specific tran- 
scriptional activity. Image is taken from University of California, Santa Cruz, genome browser, February 2009 (GRCh37/hgl9) assembly (http:// 
genome.ucsc.edu) (38). 



relative to the risk allele in forward and reverse orienta- 
tion, respectively, in both cell lines. No allele-specific en- 
hancer activity was observed for rs849133 and rs849142. 
Transcriptional activity was evaluated in seven other type 
2 diabetes-associated SNPs in JAZFI intron 1 not over- 
lapping islet FAIRE or DNase peaks, three SNPs in high LD 
and four SNPs in lower LD, with GWAS index SNP 
rs849134 (Supplementary Table 1). None of these seven 
SNPs showed evidence of enhancer activity in 832/13 or 
MING cells compared with an empty vector control (Sup- 
plementary Fig. 1). Taken together, these data demon- 
strate that rs 1635852 exhibits allelic differences in 
transcriptional enhancer activity and suggest it functions 
within a candidate cis-regulatory element at the JAZFI 
intronic type 2 diabetes-associated locus. 
Differential protein binding to rsl635852 alleles. We 
next asked whether alleles of rs 1635852 differentially af- 
fect DNA binding to nuclear proteins. Biotin-labeled 
probes surrounding the T (risk) or C (nonrisk) allele of 
rsl635852 were incubated with 832/13 or MIN6 nuclear 
lysate and subjected to EMSA. Band shifts indicative of 
multiple DNA-protein complexes were observed for both 
alleles of rsl635852 (Fig. 3A, C, D). In the EMSA from 832/ 
13 nuclear extract, three protein complexes were observed 
for the probe containing the T allele that were either less 
intense (arrow a) or not present (arrow b, c) for the probe 
containing the C allele. In the EMSA from MIN6 nuclear 
extract, two protein complexes were observed for the 
probe containing the T allele that were either less intense 
(arrow a) or not present (arrow b) for the probe containing 
the C allele. Taken together, these data suggest differential 
protein binding dependent on the rs 1635852 allele. Com- 
petition of labeled T allele with excess unlabeled T allele 
more efficiently competed away allele-specific bands than 
excess unlabeled C allele, demonstrating allele-specificity 
of the protein-DNA complexes (Fig. 3A). Based on these 
results, we hypothesized that rsl635852 is located in 
a binding site for a transcriptional regulator complex that 
may be disrupted by the rsl635852 C allele. 



Identification of proteins binding rsl635852. We 

sought to identify factors in the protein complex binding 
rs 1635852 using a DNA-affinity capture assay. The same 
biotin-labeled probes used for EMSA including the 
rsl635852 T or C alleles were incubated with 832/13 nu- 
clear lysates, and the resulting DNA-protein complexes 
were isolated and subjected to SDS-PAGE. We observed 
two protein bands showing differential intensity consistent 
with increased binding to the T allele (Fig. 35). These 
bands were identified as transcription factors PDXl (pan- 
creatic duodenal homeobox 1) and CUXl (cut-like ho- 
meobox 1) using MALDI TOF/TOF MS. 

To validate binding of rsl635852 to these transcription 
factors, we performed supershift experiments incubating 
DNA-protein complexes with antibodies for either PDXl 
or CUXl. Incubation of the T allele-protein complex with 
PDXl antibody completely disrupted protein-DNA binding 
at complex c (832/13; Fig. 3C, arrow c) and reduced 
binding to protein complex b (Fig. 3C, 832/13; Fig. 3Z), 
MIN6, arrow b). A PDXl-mediated supershift was ob- 
served in MIN6 cells for both rsl635852 alleles (Fig. 3Z)). 
A consistent supershift band was not observed in 832/13 
cells and may have been masked by presence of a larger 
DNA-protein complex (Fig. 3C, arrow d). No evidence of 
a PDX-mediated complex disruption or band supershift 
was observed in 3T3-L1 and HepG2 nuclear extracts 
(Supplementary Fig. 2A, E). In contrast, incubation with 
CUXl only slightly reduced binding to complex b and c 
(Fig. 3C, D). These data provide evidence that the 
rsl635852 T allele binds PDXl in an allele-specific manner. 
Consistent with this result, a search of the JASPAR CORE 
database (26) shows that only the sequence containing the 
rsl635852 T allele is predicted as a PDXl consensus core- 
binding motif. 

DISCUSSION 

At many of the loci identified through GWAS, association 
signals are localized to intronic and intergenic regions and 
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FIG. 2. rsl635852 alleles display differential transcriptional activity. 
A: Enhancer activity was tested in 832/13 cells for the type 2 diabetes 
risk (white bars) and nonrisk (black bars) alleles of five SNPs in can- 
didate regulatory regions in forward (Fwd) and reverse (Rev) ori- 
entations with respect to JAZFl. Significant allele-specific enhancer 
activity was observed for rsl635852. The rsl635852 risk allele T shows 
less transcriptional activity than the nonrisk allele C in both ori- 
entations with respect to a minimal promoter vector. B: rsl635852 risk 
allele displays similar decreased transcriptional activity in MIN6 cells. 
Error bars represent SE of 2-4 independent clones for each allele. 
Results are expressed as fold change compared with empty vector 
control. P values were calculated by a two-sided t test. 



are hypothesized to harbor nonpromoter regulatory ele- 
ments altering gene transcription. In the current study, we 
focused on the type 2 diabetes-associated signal in JAZFl 
intron 1 and asked whether variants at this locus displayed 
allele-specific transcriptional enhancer activity consistent 
with a c^s-regulatory effect. We used maps of open chro- 
matin to guide identification of variants with allelic differ- 
ences in transcriptional activity. We provide evidence that 
rsl635852 is a strong candidate for a differential effect on 
transcriptional enhancer activity, likely through altered 
binding in a regulatory complex containing PDXl. SNPs 
evaluated include the JAZFl lead SNPs from two recent 
descriptions of high-density genotyping at this locus (27,28). 

A challenge in mechanistic studies of GWAS signals is 
differentiating among the numerous SNPs to identify those 
underlying the disease association. On the basis of the 
assumption that a common variant with modest effect 
likely underlies the association at JAZFl, we aligned high 
LD variants (r^ > 0.8; n = 14) in JAZFl intron 1 with active 
DNA regulatory elements identified by DNase-seq and 
FAIRE-seq and found that five variants overlapped with 
islet-regulatory regions. It is important to recognize that 
current available data sets for FAIRE and DNase may not 



be complete and that SNPs in lower LD (r^ < 0.8) with the 
index SNP also may affect gene expression or activity. 
Therefore, additional functional SNPs may exist at the 
JAZFl type 2 diabetes locus that were not examined in 
this study. 

Notably, all five SNPs found in islet open chromatin 
regions overlapped with FAIRE peaks only, perhaps 
refiecting a recent observation of twice as many FAIRE 
sites as DNase sites when looking across multiple cell 
types (23). Cell-selective open chromatin tends to be lo- 
cated away from the transcription start site (23), suggest- 
ing that regulatory elements in intron 1 of JAZFl may be 
unique to islets. There was no evidence of overlap with 
open chromatin for associated SNPs in other available 
relevant tissue datasets examined. However, we cannot 
rule out that there may be shared regulatory regions in 
tissues for which FAIRE or DNase data are not currently 
available. 

Choosing the correct tissue type for testing activity of 
functional variants is critical because many regulatory 
elements act in a tissue-specific manner (25,29). Based on 
our observation of type 2 diabetes-associated SNPs in 
regions of islet open chromatin, we measured transcrip- 
tional activity in two available mammalian islet cell mod- 
els, 832/13 and MING cells from rat and mouse, 
respectively. Importantly, we found similar fold changes in 
allelic transcriptional activity across the two cell types, 
suggesting that measurement of enhancer activity may be 
consistent across species. 

Of three SNPs predicted to be located in active regula- 
tory regions that displayed enhancer activity, only 
rs 1635852 demonstrated allele-specific effects, making it 
a lead functional candidate among the SNPs tested. The T 
allele (risk) of rs 1635852 displayed reduced enhancer ac- 
tivity relative to the C allele (nonrisk), suggesting that re- 
duced expression of islet JAZFl may be associated with 
type 2 diabetes. We hypothesized that alleles of rsl635852 
might differentially bind a transcription factor, resulting in 
altered gene transcription, and our analysis of protein 
binding revealed complexes that favored the rs 1635852 T 
allele in 832/13 and MIN6 insulinoma cells. rsl635852 also 
showed a differential protein binding pattern in EMSA 
using 3T3-L1 mouse adipocyte and HepG2 human hepa- 
tocyte nuclear extracts but the pattern of binding differed 
from that of islet extracts with alternate DNA-protein 
complexes present (Supplementary Fig. 2A, E). Using DNA 
affinity capture, we identified PDXl and CUXl as two 
proteins that showed increased binding to the T allele and 
validated that in 832/13 and MIN6 cells, PDXl antibody 
disrupted at least one of the protein complexes formed 
preferentially in the presence of the T allele. Our results 
suggest that the DNA sequence surrounding rsl635852 
may bind protein differentially in multiple cell types but 
that the protein complexes involved likely differ because 
PDXl binding was exclusive to insulinoma cell types. The 
fourth base of the PDXl consensus core DNA binding 
motif (TAAT) is altered by the rsl635852 C allele. In con- 
trast, the CUXl consensus sequence, ATA (30), is not 
found in the 17 bp DNA sequence that includes rsl635852, 
suggesting that CUXl may not directly bind DNA in this 
region, but instead may be bound to a DNA-binding protein 
in the complex. Additional transcription factors not iden- 
tified here also may play a role in enhancer activity at 
rsl635852. 

PDXl plays a central role in embryonic development of 
pancreatic islets and in maintenance of normal glucose 
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FIG. 3. Alleles of rsl635852 differentially bind PDXl in rat 832/13 and mouse MIN6 insulinoma cells. A: EMSA using 832/13 nuclear extract shows 
differential protein-DNA binding of rsl635852 alleles. The probe containing risk allele T shows increased protein binding (arrows a, b, c) com- 
pared with the probe containing nonrisk allele C. Excess unlabeled specific probe containing the T allele (T-comp) more efficiently competed away 
allele-specific bands than unlabeled probe for the C allele (C-comp). To enhance visualization of protein complexes, free biotin-labeled probe is 
not shown. B: DNA affinity-capture identified differential binding of CUXl and PDXl at rsl635852 alleles in 832/13 cells. C: Incubation of 832/13 
nuclear extract with PDXl antibody disrupts the DNA-protein complex formed with T allele-containing DNA probe (arrows b and c). The presence 
of a nonallele-specific complex (arrow d) may mask a PDXl-mediated supershift. D: Incubation of MIN6 nuclear extract with PDXl antibody 
disrupts the DNA-protein complex formed with T allele-containing DNA probe (arrow b) and results in a band supershift. (A high-quality color 
representation of this figure is available in the online issue.) 



homeostasis in adult islets. Heterozygous mutations in 
PDXl result in early and late onset of diabetes in humans 
(31-33). PDXl binds to DNA sequences containing the 
consensus TAAT motif and subsequently recruits protein 
complexes that regulate transcription of several (B-ceU 
genes that include INS and GLUT2 (34,35). Whereas the 
role of PDXl as a gene activator is well-established, a recent 
analysis of genome-wide PDXl occupancy using chromatin 
immunoprecipitation highlights a strong role for PDXl as 
a transcriptional repressor (36). The same study demon- 
strates that PDXl binds across 4,470 human genes, in- 
cluding to five regions of JAZFl, with 48% of binding sites 
being located in introns (36). Our data demonstrate reduced 
transcriptional activity with the rsl635852 T aUele, sug- 
gesting that in this instance PDXl may be functioning as 
part of a transcriptional repressor complex. Further 
experiments are necessary to validate binding to rs 1635852 
in human islets with known genotypes and to elucidate 
other key factors in the repressor complex. 

Although JAZFl is not confirmed as the affected gene at 
this locus, eQTL analysis supports association of the type 2 
diabetes risk allele with decreased JAZFl transcript levels in 
adipose tissue that is consistent with the direction of 



transcriptional activity we observed in islet cells (18). 
rs 1635852 or proxy SNPs (f > 0.95) are reported associated 
with JAZFl transcript levels in at least three tissues — Oliver, 
adipose, and muscle (18-20). In addition, JAZFl expression 
is decreased in islets from type 2 diabetes donors (15). The 
strongest evidence of a biological basis for JAZFl in type 2 
diabetes susceptibility comes from observations that JAZFl 
protein binds and represses action of the metabolic regulator 
NR2C2 (10). A^r^c^-deficient mice show reduced hepatic 
triglyceride levels, reduced lipid accumulation in adipose 
tissue, and are resistant to glucose intolerance and insuMn 
resistance (13). NR2C2 is expressed in pancreatic islets (37), 
but to our knowledge it has not been weU-characterized. Our 
data suggesting differential allelic transcription in JAZFl 
may aUude to a role for NR2C2 in (B-ceU gene regulation. 
Recent chromatin immunoprecipitation analysis of NR2C2 
binding sites across the genome revealed that NR2C2 con- 
trols genes involved in fundamental biologic processes 
across diverse ceU types and in targeting genes in a ceU- 
specific manner (11). It is also possible that JAZFl may serve 
as a cofactor of additional nuclear receptors, although it does 
not regulate transcriptional activation by two tested recep- 
tors, PPARA and RORG (10). 
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In summary, we demonstrate that maps of open chro- 
matin are a useful resource to guide identification of var- 
iants in cis-acting regulatory elements at type 2 diabetes 
susceptibility loci, and we provide evidence that rsl635852, 
a SNP at the JAZFl type 2 diabetes-associated locus, dif- 
ferentially affects transcriptional activity through binding of 
a regulatory protein complex that includes PDXl. 
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